Introduction

Bioacoustic techniques, involving the recording of animal sounds, have a long history in ecological study. However, technological innovations in the last 10–20 years have greatly increased the potential of this approach for research in a wide range of habitats and for a variety of taxa (Sueur et al. 2008; Marques et al. 2013). One particularly valuable area for development is the survey of bird assemblages or populations, including rare, cryptic and disturbance-prone species such as the Corncrake (Crex crex L.), Bittern (Botaurus stellaris L.), Nightjar (Caprimulgus europaeus L.) and Western Capercaillie (Tetrao urogallus L.) (Abrahams and Denny 2018). Automated static recording units (often called ‘passive’ or ‘autonomous’ recorders) are especially well suited to point count-type surveys for highly territorial and lekking species, where systematic spatial and temporal sampling can help determine occupancy, species composition or population size (Brandes 2008; Gasc et al. 2017; Shonfield and Bayne 2017).

Western Capercaillie (hereafter ‘Capercaillie’) is a rare and declining bird species in the UK, with a small remnant population in forest habitats in northeast Scotland (Wilkinson et al. 2018). The species has an ‘exploded’ lek mating system, where males display over a dispersed area to indicate their breeding condition—the numbers of birds attending dependent on the quality and amount of the surrounding old forest habitat (Hjorth 1970; Picozzi et al. 1992; Laiolo et al. 2011; Wegge et al. 2013). Alongside other methods (e.g. Jacob et al. 2010), counts of displaying males at leks in spring are used to assess breeding population status and abundance for Capercaillie, for regional/national monitoring programmes (Picozzi et al. 1992; Pollo et al. 2005; Summers et al. 2010). In Scotland, these showed a decline in numbers of 29% between 2004 and 2010 (Ewing et al. 2012), although figures from 2010 to 2016 showed modest between-year fluctuations, with no significant overall trend (Wilkinson et al. 2018).

Although lek counting is one of the few ways to gain regular population data on this elusive species, there are a number of recognized problems with this method. Firstly, Capercaillies are known to be susceptible to human disturbance (Ewing et al. 2012; Mollet et al. 2015), and regular impacts due to traditional counts could potentially have a negative effect on local populations. Surveys normally attempt to limit this disturbance by using trained surveyors and employing hides while at the lek site (Haysom 2013). Secondly, the remoteness of many lek sites, combined with few suitably experienced surveyors, produces practical constraints on surveys, limiting their spatial and temporal coverage. With few monitoring visits taking place, the quality of data from lek counts may be affected by differences in detection probabilities between sites or survey events, or by measurement and identification errors (Simons et al. 2009; Celis-Murillo et al. 2009). Finally, Capercaillie behaviour may affect lek attendance, and make the interpretation of count data difficult. Lek attendance is age dependent, with males aged > 2 years defending territories close to the lek centre, while younger males establish peripheral territories or do not display territorial/lekking behaviour (Mollet et al. 2015). Time of day and date in the season can also impact lek-attendance patterns, and hence, lek counts. Such sources of variation need to be considered when assessing count data, and the detection probability of birds estimated, protocols standardized, and lek counts adjusted to properly estimate populations (Walsh et al. 2004).

The use of acoustic recording, alongside existing survey methods, could reduce the recognized biases outlined above, enabling the improved monitoring of Capercaillie lek sites and providing more accurate population estimates (Laiolo et al. 2011; Oppel et al. 2014). This study used automated recording units for monitoring leks and aimed to determine how data from bioacoustic recording compares to the traditional human lek count method. The objectives of the study were: (1) to compare methods of population assessment, by comparing counts of males from traditional surveys with levels of recorded call activity from the same leks; and (2) to determine whether the levels of call activity from different lek sites can be related to environmental variables, such as weather conditions.

Methods

Acoustic field recording

Ten Wildlife Acoustics (www.wildlifeacoustics.com) SongMeter SM2 acoustic recorders were placed at known Capercaillie lek sites near Aviemore, Scotland (57.19°N, 3.82°W). A further two were also placed, but failed to record correctly, and were excluded from the analysis. The lek sites were commonly 3–5 km apart, with the furthest points at 27.5-km distance and encompassing an area of 209 km2. The recorders were positioned on 4–5 April 2017, but programmed to start recording on 10 April 2017. Recorders were collected on completion of the 10 May 2017 recording session, after 31 days of operation.

Each recorder was programmed to record in mono, with one Wildlife Acoustics SMX-II omnidirectional microphone mounted on the recording unit. The recorders were placed in the vicinity of the normal lek count hide locations, with the microphones oriented horizontally towards the assumed lek site ‘centre’. The recorders were attached to trees at approximately 1.5 m off the ground and Global Positioning System (GPS) coordinates taken.

The recorders were programmed to operate daily, from 1.5 h before sunrise until 1.5 h after sunrise. This resulted in a recording period from 0451–0751 hours on 10 April 2017 to 0326–0626 hours on 10 May 2017. Recording was limited to these times based on previous experience (Abrahams and Denny 2018), standard lek count practice (Haysom 2013) and surveyor advice (S. West, personal communication), whilst saving the limited battery life and data storage capacity.

Data was recorded in 10-min-duration full-spectrum data files in Waveform Audio File (.wav) format on SD cards at a sampling rate of 8000 Hz and 16 bits/sample. This resulted in recordings covering the frequency range up to 4000 Hz, sufficient to record Capercaillie vocalizations, but exclude higher frequency bird song and other sounds. All recordings had the date, time and recorder site reference appended as metadata.

Lek count

Lek counts took place between 16 April and 1 May 2017, during the period when recorders were placed on site. Hides were set up at the lek sites on the day before the count, and then surveyors entered the hides during the evening. The following morning, surveyors recorded Capercaillie activity and attempted to determine the number of male birds displaying (Haysom 2013). All sites were visited twice during the survey window, except for site 8528, which was only visited once. Survey visits were separated by 1–12 days (mean of 5 days). At two locations, two separate hides were used (giving separate counts), as the lek areas were too large to cover with one hide only. At these two locations, recorders were placed to match the hides (i.e. with recorders 8552 and 9306 at one location, and 8535 and 8607 at another). Hence these two locations are effectively treated as each representing two separate lek sites. This may not be the case ecologically, but it is the case practically, in terms of both the lek counts and using acoustic recorders. The recorders were sufficiently spaced to avoid any double counting in the acoustic data, with separation distances of 260 m between sites 8535 and 8607, and 85 m between sites 8552 and 9306 (Venier et al. 2012; Sedláček et al. 2015; Abrahams and Denny 2018). Site names and details are not given here due to landowner confidentiality.

Call analysis

Data sets

The audio recordings taken from the field consisted of 5580 .wav files (49.9 GB of data). This dataset was analysed using a semi-automated system to identify Capercaillie vocalizations in the recordings, with a three-stage process to produce a Capercaillie call ‘recognizer’ and final output. An additional dataset was used for recognizer training, consisting of recordings collected in the same area of Scotland in 2016 (Abrahams and Denny 2018). This 2016 dataset consisted of 1586 .wav files (28.2 GB data) collected over 14 days at three lek sites. It was originally recorded at 12-kHz sample rate, and so was downsampled to 8 kHz to match the 2017 recordings.

Data were analysed using Kaleidoscope Pro 4.3.2 software (Wildlife Acoustics 2017), using its cluster analysis method. This process uses hidden Markov models to search for repeated phrases in the recordings (e.g. the song of a particular bird species), and groups these into a number of clusters based on their similarity. This study represents one of the first published tests of this software, filling a recently identified gap in the literature (Knight et al. 2017; Priyadarshani et al. 2018).

Training data recognizer development

The Kaleidoscope software was used to process the 2016 training dataset with unsupervised clustering, default settings and a 1–3 kHz frequency signal of interest (see “Appendix” section). This preliminary analysis scanned the raw recordings to identify individual phrase segments within the training data, each of these being 1.0–6.9 s in duration (the typical song length of Capercaillie), and comprising a sequence of syllables with a maximum inter-syllable gap of 1 s. These segments were then grouped by the software into clusters of similar sound characteristics (e.g. different species or song types), and the results saved.

All of the phrase segments from the 2016 training data were manually reviewed by a visual check of spectrograms and by listening to playback, and then tagged as ‘CP’ if Capercaillie vocalizations were present, or ‘NOTCP’ if Capercaillie vocalizations were not present (when the segment was a record of sounds of other bird species or other noise). Each phrase segment could include vocalizations by more than one bird species, if these were singing simultaneously within the target frequency band, but they were assigned as CP if calls from Capercaillie were present. The clustering process enabled a rapid review of the segments, as similar vocalizations were grouped together.

After all phrases had been tagged, a second pass classification was carried out on the training data by re-scanning the manually tagged recordings. This second run used the identification tags to create a pairwise CP/NOTCP recognizer file with which to analyse the 2017 data. No further manual check was done at this second stage, as the sole intention of the second pass is to use the manual identification tags entered into the first pass results to produce the recognizer file for future analysis.

Audio data analysis using call recognizer

The pairwise recognizer, produced from the 2016 training data, was used to analyse the 2017 data. Using the automated process within the Kaleidoscope software, this identified phrase segments in the data, using the same parameters as above, and assigned these vocalizations as either CP or NOTCP. The analysis output was a spreadsheet of call data, with one row per phrase segment (hyperlinked to the associated .wav file), providing information on the vocalization (e.g. duration and frequency), the recorder site, date and time, and whether it was assigned to CP or NOTCP.

Metrics were calculated to report the quality of the recognizer process in detecting vocalizations of the target species. The proportions of false positives and false negatives within the output dataset were estimated by manual review of a random sub-sample of the clustered phrase segments, and precision, recall, and F-score calculated (Knight et al. 2017; Chambert et al. 2018). Precision is the proportion of classifications that are true detections of the target species, while recall is the proportion of target species vocalizations correctly recognized. The F-score is a combined score, summarising precision and recall together. The best recognizer models should have both high precision and recall, indicating low levels of false positive and false negative identifications.

Environmental data

The altitudes of recorder locations were derived from Ordnance Survey maps, based upon the logged GPS locations. Daily weather data for the Met Office MIDAS station at Aviemore was accessed through the Centre for Environmental Data Analysis (www.ceda.ac.uk). This included the following parameters: precipitation amount (millimetres), maximum air temperature (degrees Celsius), minimum air temperature (degrees Celsius), grass temperature (degrees Celsius), concrete temperature (degrees Celsius), sun duration (hours), wind speed (miles per hour) and maximum gust speed (miles per hour). The altitude and weather parameters were all tested separately for relationships with the daily levels of call activity, using Spearman’s rank correlation. All data visualization and statistical tests were carried out using R and R Studio software (R Core Team 2013; RStudio Team 2015).

Results

Lek counts

The lek counts recorded between zero and seven males at each site (Table 1), with the maximum of seven males at two sites: 8621 and 9319. No males were recorded at sites 8535 and 8528, but only one count took place at the latter site. The number of males counted in repeat surveys stayed the same at only one site, and mostly varied by one or two individuals between visits.

Table 1 Lek count data at each recorder site (with date of lek count survey)

Phrase segment classification

Analysis of the 2016 training data produced 20,493 phrase segments, grouped into 12 clusters. A total of 2114 segments (10.3%) were manually assigned as CP, with the remaining 18,379 segments (89.7%) tagged as NOTCP (Table 2).

Table 2 Phrase segment identifications at each analysis stage

A total of 86,624 phrase segments were obtained from the 2017 analysis data. Of these, 23,688 (27%) were matched by the software recognizer to CP, and 62,936 (73%) to NOTCP (Table 2). A random sample of 3755 phrase segments was manually reviewed, and 1146 (31%) identified as CP, with 2609 (69%) identified as NOTCP (see Table 3). These manually identified segments were individually compared to the automated recognizer classifications, with 2952 segments (79%) correctly classified by the recognizer to either CP or NOTCP. There were 803 (21%) false identifications, comprising 583 (15%) false positives, and 220 (6%) false negatives. These values mean that the recognizer classification had a precision of 0.61, recall of 0.81 and F-score (β = 1) of 0.70.

Table 3 Automated vs. manual identification error matrix, indicating the numbers of phrase segments manually reviewed and their match or mismatch with the automated recognizer identifications; for abbreviations, see Table  2

Call rate at each site

The mean number of automated CP phrase segments recorded over the month was 2369 per site (76/day per site) (Table 4). The highest levels of vocalization were recorded at site 8552, with 4114 CP segments recorded over the month (mean 133/day). The smallest number of segments was at site 8607, with only 794 (mean 26/day). Sites 8528 and 8535 had no males recorded in lek counts, but respectively had 2580 and 881 CP phrase segments. However, manual review confirmed that these were likely to be nearly all false positives. The proportion of manually identified CP phrase segments at each recorder site, as identified in the sub-sample of 3755 segments, varied between 0 and 90% (Table 4). This confirmed occupancy at all sites, apart from site 8528. A secondary check of all phrase segments for this particular recorder site confirmed that all CP phrase segments were false positives, despite the relatively high number of CP segments identified by the automated recognizer.

Table 4 Number of CP phrase segments identified by automated recognizer and manually confirmed phrase segments at each recorder site from a sub-sample of 3755 segments

To compensate for the incorrect classifications from the automated recognizer, the number of automated CP phrase segments for each recorder site was multiplied by its manually identified CP percentage to produce a simple CP call index. This is shown in the final column of Table 4. It is proposed that this index provides a more accurate assessment of Capercaillie vocal activity at each recorder site than the uncorrected number of CP phrase segments.

The primary aim of this study was to compare the results from lek counts and bioacoustics monitoring. A significant positive relationship was found between the numbers of males recorded by lek counts and the CP call index (S = 223.52, ρ = 0.804, p < 0.01) (Fig. 1). So, it appears that the greater the number of males, the larger the number of vocalizations over the course of a month. In a shorter time frame, there was no significant relationship between the number of males and the number of automated CP segments recorded on the day of the lek count survey (S = 738, p = 0.14).

Fig. 1
figure 1

Capercaillie vocalizations present (CP) call index is positively correlated to the number of males recorded in lek counts. Spearman’s rank correlation test results: S = 223.52, ρ = 0.804, p < 0.01

One notable feature of the results is the differences between the ‘paired’ recorders at sites 8552/9306 and 8535/8607. The lek count results within these pairs differed, with 5–6/2 males and 0/1–3 males respectively. The CP call index and number of CP phrase segments differed substantially between the first pair of sites (where recorders were only 85 m apart), but matched closely at the second pair (Table 4, Fig. 1).

Call rate in relation to date/time

The CP call index and number of automatically identified CP phrase segments recorded per day varied in total across all locations (Fig. 2), and at each recorder site (Fig. 3). The total number of CP segments recorded ranged from 362 on 25 April, to 1229 on 9 May, with CP call index tracking this closely, except towards the end of the survey period. The highest numbers of CP segments, recorded during May, were presumably due to increasing dawn chorus activity from bird species other than Capercaillie.

Fig. 2
figure 2

CP call index and number of automated CP phrase segments totalled across all sites showed substantial daily variation, but no obvious trend through the survey period

Fig. 3
figure 3

CP call index varied widely between dates and locations. Plots show the CP call index each day at each recorder site. Lek count survey dates are shown by triangles along the baseline

There were relatively continuous, but dynamic levels of daily activity at each recorder site where Capercaillie were present. The CP index varied daily (especially at site 9348), but with no overall trend through the survey period. The median date of CP segments varied between recorder sites. Sites 9306 and 9348 were ‘early season’ sites, with median dates of 22 April, while 8535 and 8607 (one of the sets of paired sites), which were the ‘latest’ in the season with median dates in May, were the least vocally active and showed high levels of false positives in their data.

A broad daily peak in call activity was recorded in the hour before sunrise. However, there was variation between recorder sites, with sites 9319 and 9558 showing a peak earlier in the morning, and 8621 and 9306 showing a peak closer to sunrise (Fig. 4).

Fig. 4
figure 4

Timing of Capercaillie call activity varied between recorder sites. Plots show the CP call index in 10-min periods related to sunrise at each recorder site

Call rates in relation to environmental factors

The weather variables were tested for correlation with both the number of CP phrase segments and the CP call index, but no significant relationships were found.

The altitude of the lek sites varied between 200 and 390 m above ordnance datum. A significant negative relationship was found between altitude and the CP call index using Spearman’s rank correlation (S = 1845.5, ρ = − 0.619, p < 0.01). Although this is mainly due to the levels of vocal activity of the birds present, it also appears to be partly attributable to the number of males in attendance at the lek, which is also negatively related to altitude (S = 1662.8, ρ = − 0.458, p = 0.048).

Call rates during lek counts

The data in Fig. 3 suggested that the CP call index and number of CP phrase segments may be lower on lek count days than on average, which possibly shows disturbance by surveyors on these occasions. This hypothesis was tested using Mann–Whitney U-test and a significant difference was found between the number of CP phrase segments on lek count days in comparison to the days when no surveyor count was conducted (U = 1482.5, p < 0.001). Lower vocal activity was recorded on lek count days (Fig. 5), with the median number of phrase segments decreasing from 62 to 23 per day.

Fig. 5
figure 5

Number of CP phrase segments was significantly lower than average on the days when lek counts took place. Mann–Whitney U-test of survey against non-survey days U = 1482.5, p < 0.001

Discussion

Acoustic data analysis

Autonomous recorders are being increasingly used to determine the presence/absence and abundance of bird species (Shonfield and Bayne 2017). Recorders can be left in the field for long periods to acquire significant quantities of data, but manual processing by human observers can then be labour intensive. To counteract this, automated methods are being developed, either to recognize species vocalizations outright or to classify and group recordings, to make manual checks more efficient (Priyadarshani et al. 2018). Due to the recent release of the Kaleidoscope Pro software employed in this study, tests of its use and validity are rare in the literature.

Similar analytical methods to those used here have been undertaken in previous studies on avian acoustics (e.g. Machado et al. 2017). A semi-automated process can substantially reduce the time required for analysis, but introduces the potential for machine errors in call recognition, with both false negative and false positive classifications likely to result. Due to this, a range of studies have compared the efficiency of manual identification with a variety of automated or semi-automated methods. These have found that automated methods require less time for data processing, but sometimes identify fewer target sounds than manual analysis, and can make more false positive identifications (Swiston and Mennill 2009; Knight et al. 2017). The best approach is therefore likely to be an integrated approach with automated and manual methods combined, as used in the current study.

Recognizer metrics were calculated using manual identification of a subset of the automated data to assess the quality of the analysis (Knight et al. 2017; Chambert et al. 2018). Only 4% of the overall dataset was used for this assessment, but Chambert et al. (2018) state that quality assurance of automated recognizers can be achieved with a manual review of as little as 1% of the total data. The metrics indicated that the Kaleidoscope recognizer classification, as implemented here, had a precision of 0.61 and recall of 0.81. A number of previous studies reviewed by Knight et al. (2017) had a mean precision of 0.71 and mean recall of 0.60. The current study, therefore, had a precision lower than these reported cases, with a relatively high number of false positives. However, recall was high due to the low number of false negatives. In practical terms there is a benefit, in single-species studies like this one, from weighting analysis towards false positives, as the smaller dataset of ‘positive’ identifications is easier to manually review, instead of finding missed calls in the larger ‘negative’ dataset.

The moderate level of precision in this study means that the number of CP phrase segments is likely to be an overestimation of the actual Capercaillie calls in the data set (as presented in Table 4). This is especially clear in the data from site 8528, where all segments identified as CP were false positives. This potential inaccuracy was compensated for in this study by using the results from the manual review as a correction factor for the number of CP segments, to produce a CP call index that more accurately reflected actual Capercaillie vocal activity. As a consequence, taking into consideration the time and costs of analysis, an appropriate approach is believed to have been taken in this study to find a balance in terms of the accuracy required to generate useful data. Clearly the levels of false positive and false negative classifications could be reduced, if needed, with more detailed manual checking of the data.

Capercaillie call activity is related to lek count numbers

The primary objective of this study was to determine whether bioacoustics could be used to assess lek activity levels, and could be related to the numbers of displaying males present. It is established for some species that call rate can be used as a proxy for population abundance (Laiolo et al. 2011; Oppel et al. 2014; Knight et al. 2017), and that vocal display effort can increase in line with lek size or visitation rates to the lek (Westcott 1992; Pizo and Aleixo 1998; Cestari et al. 2016).

For Capercaillie, vocal characteristics often correlate with male quality, health condition, and competition levels—‘fitter’ males using lower frequencies, higher song rates and longer display durations. Displaying Capercaillie males from neighbouring grounds are also known to stimulate each other by increasing song rates, and song rate has been found to be significantly correlated to the number of displaying males in an area (Laiolo et al. 2011).

A significant positive relationship between Capercaillie lek count numbers and call activity was identified in this study, indicating that the CP call index and total number of CP phrase segments over an extended survey period (such as the month-long duration used here) may serve as an indicator of lek size. However, this does not appear to be true over shorter time frames, due to the considerable daily variation in call activity. This variation is likely the result of factors such as the date within the lekking season, and weather conditions affecting bird behaviour—factors that could also affect lek counts, and which are perhaps not considered fully within that method at the current time (Raynor et al. 2017; Fremgen et al. 2018). In addition, the mobility of Capercaillie around the lek site, with birds potentially displaying from different places on different mornings, may affect their detectability on a day-to-day basis, and adverse weather conditions (i.e. high winds and rain) could potentially mask calls in the recordings on some days.

The correlation found between lek numbers and call activity in this study indicates the potential for bioacoustics to be used alongside existing population monitoring methods for Capercaillie. Bioacoustics could be used effectively to calibrate the results gained from lek counts, especially when monitoring a large number of sites, conducting surveys at night, and covering a broad timescale, which are all difficult to achieve with many traditional survey methods (Furnas and Callas 2015; Gasc et al. 2017). This would add a new dimension to the vital understanding and flexibility of approach gained from human surveyors undertaking lek counts, with both methods being employed together to provide complementary information on lek activity (Venier et al. 2012; Shonfield and Bayne 2017).

Effects of lek count surveys on vocal activity

In this study, recorded vocal activity dropped to approximately 1/3 of average levels on days when surveyors visited the lek sites. As the most extreme examples of this, the lek counts at sites 9348 and 9558 both occurred on the 2 days when the lowest levels of call activity were recorded at these sites. This raises two potential issues. Firstly, it could indicate that surveyors caused sufficient disturbance to affect and reduce the vocal activity at the lek. This would be of concern, if true, as there could potentially be ecological effects of this on Capercaillie. The species is known to be susceptible to human disturbance, particularly at lek and brood-rearing times (Summers et al. 2009; Moss et al. 2014), and recreational impact studies have shown that disturbance of lekking birds may lead to local population effects, either by preventing display, avoidance of disturbed sites by hens, or preventing recruitment of young males (Marshall 2005). Studies have rarely been conducted on any bird species to determine whether the presence of surveyors has any significant impact on birds. One previous study, by Campbell and Francis (2012), used acoustic recording to assess point counts of passerines in old field habitat and found no difference in the results when an observer was present or absent. How this finding relates to other habitats or species assemblages is not known. Despite the lack of investigation into surveyor impacts on lek activity, the potential for this to occur in Capercaillie has been fully recognized, with precautions to avoid disturbance recommended as part of published survey methods (e.g. Haysom 2013). In this study, the overall correlation between Capercaillie vocal activity and lek counts suggests that there are likely no longer-term effects over the lek period. In addition, the observations of normal behaviour in the field during lek count visits gave no indication of other apparent adverse impacts.

A second potential explanation for the reduced call activity on count days is that it is just a chance effect, and that no disturbance is caused by lek counting. However, this raises the question that if call activity (and hence lekking) happens to be lower on the necessarily limited number of survey visits due to other factors such as weather, then the results from the survey might not accurately reflect the size of the lek, and could underestimate the number of males present. This would have an effect on the quality of the population assessment achieved from the count surveys. Further work on this issue is clearly needed.

Capercaillie call activity is related to temporal and environmental variables

The age and sex biases of lek counts are well recognized (Storch 1997; Mollet et al. 2015). In addition, there are a number of temporal and environmental conditions that can affect detectability and occupancy at leks (Walsh et al. 2004; Raynor et al. 2017; Fremgen et al. 2018). An understanding of these factors is critical to improving population estimates for the species being monitored (Drummer et al. 2011; Sadoti et al. 2016; Priyadarshani et al. 2018). Within this study, the levels of vocal activity were affected by date, time and altitude, but (perhaps surprisingly) no relationship with weather variables was found.

The number of CP phrase segments recorded on a daily basis, totalled across all sites, varied by a factor of nearly 4, but with much greater variation within sites. There were differences between the CP call index and number of CP segments, with the latter showing the highest levels of call activity at the end of the survey period, during May. The median dates for CP call index ranged between 22 April and 5 May, and a daily peak was recorded at 0.5–1 h before sunrise, at approximately 0500 hours (similar to Abrahams and Denny 2018). These key periods of activity confirm the recommendations of Haysom (2013) that Capercaillie lek surveys in Scotland should take place from mid-April to early May, and between 0400 and 0600 hours. As the numbers of birds present at the lek will vary depending on time of day and season, the programming of surveys is of high importance. For Black Grouse, Cayford and Walker (1991) found that the highest counts were obtained in April and early May, 1 h either side of sunrise, and also that counts varied by as much as 80% depending on the time and month they were taken. Bioacoustic monitoring therefore has great potential to confirm when best to conduct lek counts, or to allow inter-calibration of counts conducted at different dates across a season.

Lek attendance is likely to vary by latitude and elevation for a range of species (Sadoti et al. 2016). The negative relationship with altitude found in this study appears to be driven both by the number of male birds present and their levels of vocal activity, with more birds calling more frequently at lower altitudes. This finding appears to contrast with research from other parts of Europe, which has found that Capercaillie prefer elevated sites, such as ridgelines (Rolstad and Wegge 1987; Saniga 2002), although Haysom (2013) notes that, in Scotland, leks usually occur on raised areas but not on the tops of hills. It is possible that the more populated leks here tend to occur in lower lying areas where environmental conditions are more benign. It is known that egg-laying becomes later with increasing altitude, so this variable does have an impact on life history. However, the leks studied here were in a relatively narrow altitude band, and it is possible that the relationship identified would not hold across a wider range of altitude.

Abrahams and Denny (2018), Drummer et al. (2011) and Sadoti et al. (2016) have all found an inverse relationship between lekking activity and wind speed. This may be related to noise sensitivity, as noted in the Greater Prairie Chicken Tympanuchus cupido (Walsh et al. 2015), so the use of detailed weather information from lekking areas is an important avenue for future study to better understand effects on lek attendance and vocal activity patterns. In addition, masking by wind noise will also affect the quantity and quality of bird vocalizations recorded by bioacoustic methods, so this should be taken into account when analysing and interpreting audio data. Despite the daily variation identified here in vocal activity, many studies often use single survey counts to assess lek attendance and population numbers. However, this cannot be recommended, as lek attendance and activity levels may be suppressed even when weather conditions are within the recommended ranges for surveys. As a result, multiple visits may be required to gain accurate lek count numbers, to counteract variations in detectability and occupancy; this should be incorporated into survey guidance (Cayford and Walker 1991; Sadoti et al. 2016).

Conclusion

The continued precarious state of the Capercaillie in Scotland means that effective monitoring at both national and local levels is required. Large-scale surveys can track overall population trends and changes in distribution, but can not effectively inform assessments of localized populations or conservation management actions. Recent developments in national survey methods (Wilkinson et al. 2018) have improved the accuracy of winter transect data. The same effort is now required to refine lek monitoring techniques. The recommendation by Wilkinson et al. (2018) that greater consistency should be developed in lek count methods is echoed here, to allow a better understanding of how lek counts relate to actual numbers of birds and how numbers vary across the season.

The deployment of the acoustic recorders in this study successfully recorded Capercaillie vocal activity at a number of lek sites, providing comparable, simultaneous and long-term data from each location. This has provided new understanding of lekking activity levels over time, has indicated a potential weakness in human lek count methods, and has raised the possibility of using bioacoustics for efficient and effective population monitoring. There are limitations to the bioacoustics approach in recording only audio data, with issues over quiet individuals/species, range of detection, the spacing of recorders, etc. However, the collection of valuable biological data, over long time periods and across large, difficult to access, areas, is a recognized benefit of the bioacoustics approach, enabling reliable estimates of species occurrence and, potentially, abundance (Swiston and Mennill 2009; Blumstein et al. 2011; Furnas and Callas 2015; Gasc et al. 2017; Shonfield and Bayne 2017). Longer-term surveys using acoustic methods could provide an alternative data source where lek counts are not possible, or potentially allow calibration between lek counts undertaken at different sites on different days (Sadoti et al. 2016). This would help to address variation in lek attendance through the season (Sadoti et al. 2016; Fremgen et al. 2018). In addition, bioacoustics offers a non-intrusive survey method in comparison to other techniques such as lek counts, GPS/radio tagging and use of dogs. The benefits of this when dealing with small, declining and disturbance-sensitive populations at a critical stage of their life history should not be underestimated.

In the future, the use of multiple microphone arrays could allow the location of individual male birds to be plotted across a site, to develop a map of distribution and activity areas across a lek (Mennill et al. 2012). This could potentially be combined with individual call recognition and genetic capture-recapture techniques to allow highly detailed assessments of Capercaillie ecology and conservation status (Jacob et al. 2010). There is an increasing body of scientific studies on avian bioacoustics (Gasc et al. 2017; Shonfield and Bayne 2017), but standard and consistently applied guidance is still lacking for conservation managers (Browning et al. 2017; Abrahams 2018). However, as a first step, the relatively simple techniques demonstrated here should be developed and implemented to gather valuable new types of data that will help inform the conservation efforts for this iconic species.